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Abstract 



,^ . We study a comprehensive list of quantum codes as candidates for codes used at the physical level 

'"^ ' in a fault-tolerant code architecture. Using the Aliferis-Gottesman-Preskill (AGP) ex-Rec method we 

^O . calculate the pseudo-threshold for these codes against depolarizing noise at various levels of overhead. 

We estimate the logical noise rate as a function of overhead at a physical error rate of po = 1 x lO^''. 

(~| , The Bacon-Shor codes and the Golay code are the best performers in our study. 
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1 Introduction 

A great insight in the early history of quantum computing was that almost perfectly reliable quantum com- 
putation is possible with physical devices subjected to noise as long as the noise level is not too large. 
This observation has given us confidence that, ultimately, it will be possible to build a functioning quantum 
computer. However, the "thi^eshold theorem" that indicates how much noise can be tolerated has not other- 
wise given a very optimistic prognosis for progress. For example, the well-studied seven qubit Steane code 
[[7,1,3]] has a threshold against adversarial noise that is in the O(10~^) range. This level of noise is far 
lower than anything that has been achieved in any laboratory - it is actually a significantly lower eiTor rate 
than many experiments are even capable of measuring. 

In this paper we analyze both previously considered and new additional quantum codes and determine 
their thresholds, logical error rates and overheads using the ex-Rec method developed in [JJ. This allows us 
for the first time to compare the relative merit of many schemes. We will argue how the studied codes could 
serve as bottom (physical-level) codes in a fault-tolerant code architecture that minimizes the overall coding 
overhead. 

In order to cany out this comparative code study we must make some simplifying assumptions. First of 
all, we assume that gates can be performed in a non-local geometry. It is likely that an ultimate quantum 
architecture will be largely restricted to local ID, 2D or 3D geometries, hence the threshold numbers that 
we estimate will be affected by this architecture constraint. As was shown in EllSl, the price for locality 
may be modest for small codes, but it will typically be worse for lai^ger codes 121 since a bigger effort will 
have to be mounted to make the error-correction circuits local. 

Secondly, the noise model for our study is a simple depolarizing noise model with equal probabilities 
for PauU X, Y and Z errors. In particular, we assume that any single qubit location in the quantum circuit 
undergoes a Pauli X,Y or Z eiTor with equal probability po/3- A two-qubit gate undergoes no eiTor with 
probability 1 — po and one of 15 eiTors with probability po/15. The reason for choosing this model is that it 
is the simplest unbiased choice for a noise model given that a comparative study of the performance of codes 
for general noise models (such as superoperator noise) is infeasible computationally. We do not expect the 
performance of these codes to differ greatly if we would choose a biased depolarizing noise model, since 
there are no intrinsic biases in the codes themselves. To deal with biased noise, it may be more advantageous 
to use specific biased code constructions such a non-square Bacon-Shor or surface code or the use of the 
repetition code against high-rate dephasing noise in H. 

We do not establish rigorous threshold lower-bounds by counting the number of malignant sets of faults 
as in |[T]. Instead of counting or sampling of malignant faults, we simply simulate the depolarizing noise 
and keep track of when it leads to failures. We put error bars on our results such that a rigorous lower bound 
on the pseudo-threshold is within this statistical error bar. The level- 1 pseudo-threshold is the value of po 



where the level- 1 encoded error rate pi = po IS- It is only the pseudo-threshold that is of interest in this 
study of bottom codes since we envision, see Section 11.11 that a different code would be used in the next 
level of encoding. 

Our study is in some sense a continuation of the first comparative code study by Steane Q. Our analysis 
goes beyond these previous Monte Carlo studies of quantum fault-tolerance in that it includes more codes 
and focuses on a fault-tolerance analysis of the logical CNOT gate. One of the problems with comparing 
threshold estimates in the literature is that threshold numbers for different codes have been obtained by 
different methods, some more rigorous than other's. We believe that it would be advantageous to stick to 
one clear, rigorously motivated method. The AGP method has the advantage of being tied to a fully rigorous 
analysis lH and the AGP method when used to (approximately) count the number of malignant sets of faults 
can give tight estimates of the (pseudo)-threshold. 

With a few exceptions, we use standard Steane error-correction circuits (and sometimes omit possible 
code-specific optimizations) that allows us to compare codes directly. We discuss these possible code opti- 
mizations and our choices in Section |2l We do not separately analyze Knill's post-selected and Fibonacci 
schemes lH [lOl but we plot Knill's numbers in Figure \T4\ One scheme that is not included is the surface 
code scheme described in Ref. ifTTl . However, we do study surface codes in the original setting of |[T2l using 
Shor error-correction circuits. 

1.1 The Code Architecture 

The usefulness of error correction in computation is roughly measured by two parameters. First is the re- 
duction in error rate that is obtained by using the code; this feature depends on the (pseudo)-threshold of the 
code for the particular noise model and the error correction circuits that are used. The second figure of merit 
is the smallness of the overhead that is incuiTcd by coding. There is a tradeoff that one can expect between 
overhead and logical eiTor-rate that mimics the trade-off between distance and rate of error-con^ection codes, 
see Figure [131 Since error levels of physical implementations are expected to be high, optimistically in the 
range from O(10~^) to O(10~®), it is clear that at the bottom level, optimizing the threshold has priority 
over optimizing overhead. This leads us to consider the following simple code architecture. At the physical 
level, we use a bottom code Cbot which is chosen to have a high noise thi^eshold and a reasonable overhead. 
This paper will be devoted to a comparative study of such codes. We will pick some illustrative numbers to 
argue how one can envision completing the code architecture. We will see that one can find a bottom code 
that maps a base error rate of po = O{10~^) onto a logical eiTor rate of pi = O(10~^) (see Section [5ll. 
To run a reasonable-sized factoring algorithm one may need an logical eiTor rate of, say, O(10~^^)lj. Thus 
one needs a top code Ctop that brings the error rate O(10~^) to O(10~^^). The desirable features of the top 
code are roughly as follows (see also the Discussion at the end of the paper). The top code is a block code 
[[n, k, d\] with good rate k/n in order to minimize the overhead. The improvement in error-rate for a code 
which can correct t eiTors is roughly 

Pi^Poi — ) , (1) 

\PthJ 

where po is the unencoded eiTor-rate and pth is the threshold eiTor rate. Thus in order to get from po = 
O(10~^) to pi = O(10~^^) we could use a code which can correct 4 errors and has a threshold of O(10~^). 
In |[T3l Steane studied several block-codes which may meet these demands. The polynomial codes would 
also be an interesting family to study in this respect. 



'An n-bit number can be factored using a circuit with space-time complexity of roughly SGOn* |20| , so RSA-1024 could be 
broken using a circuit with 0(10^^) potential fault locations. Using different architectures, it may be possible to reduce this to 
O(10") see for example ED- 



2 Preliminaries 

For our study it is necessary to select a subset of quantum codes. We focus on codes that are likely to have 
a good threshold, possibly at the cost of a sizeable but not gigantic overhead. To first approximation the 
threshold is determined by the equation 

Pth = Np\+^ => pth = N-^/\ (2) 

where t is the number of eiTors that the code can conect and A^ is a combinatorial factor counting the sets 
of i + 1 locations in an encoded gate that lead to the encoded gate failing. It is clearly desirable to minimize 
the number of locations and maximize t. This consideration has led us to primarily consider Calderbank- 
Shor-Steane (CSS) codes. Any stabilizer quantum code is CSS if and only if the CNOT gate is a transversal 
gate ifTTl . The advantage of a transversal CNOT is that it minimizes the size of the encoded CNOT; the 
bulk of the CNOT rectangle will be taken up by error correction. This is favorable for the noise threshold of 
Cbot- Secondly, minimizing the error-rate of the encoded gate Cbot(CNOT) will be useful at the next level 
of encoding, because CNOTs occur frequently in EC and their error rates play a large role in determining 
whether error rates are below the threshold (of Ctop)- However to demonstrate that this restriction to CSS 
codes is waiTanted we also consider the non-CSS 5-qubit code [[5, 1, 3]] which is the smallest code that can 
con^ect a single error. We indeed find that this code performs worse than Steane's 7-qubit code [[7, 1, 3]], see 
Section |5] and the Data Tables in Appendix ICl 

2.1 Approximate Threshold ScaUng 

In this section we discuss the global behavior of the noise threshold as a function of block size n, distance, 
and other code properties. Let us consider Eq. ^ and see how we can get the best possible threshold. An 
upper-bound on N is (^ , ^) where A is the total number of locations in the encoded gate (rectangle). Ideally, 
a code family has a distance that is linear- in n, i.e. t is linear in n. Let us assume for simplicity that only 
some fraction of all locations appears in the malignant fault sets of size t + 1, i.e. we model A^ w ( ^™J') 
where Amai < ^- The locations in ^mai are in some sense the weak spots in the circuits; overall failure 
is most sensitive to failure at these locations, ^mai may be either linear or super-linear in the block size n. 
In case j4jnai scales linearly with ?i, and t = 5n for some 5 < 1/4, the threshold in Eq. (|2]) increases as a 
function of n and asymptotes in the limit of large n to a finite value. Indeed, for Amai = o.n and 5 <C a 
(which is typically the case since t < n/4) we get, using Sterling's approximation, 

,. / an \-'/^^"^ 5 r.(^''\ 
Pth = lim = — + O — (3) 

n~^oo \pn + 1J ea \ct / 

Such monotonic increase of the threshold with block-size is clearly desirable. It is also clear that when t 
is constant, for any polynomial ^mai = poly(n), the threshold pth in Eq. Q decreases as a function of n. 
When ^mai scales super-linearly with n and t is linear in n we get the following behavior. First, the threshold 
increases with n (the effect of larger t), then the threshold declines since the effect of a super-linear Amai 
starts to dominate. For codes and EC circuits with this behavior, it is thus of interest to determine where this 
peak threshold performance occurs. We will see some evidence of these peaks in Figure [TO]in Section [S] 

Now let us consider the scaling of A (and A^i^i) in case we use Steane eiTor correction. In Appendix |A] 
we review how we can bound A for a CSS code with Steane error correction, but a rough estimate is that 

A = CiAcnc + C2^vcr + Csn. (4) 

Here Acne is the number of locations in the encoding of the ancillas for eiTor correction, and A^cr is the 
number of locations in the verification of the ancillas for error correction. The additional term linear in n 



comes from the transversal encoded gates and the transversal syndrome extractions. For a CSS code and the 
standard encoding construction (see Appendix lAJ). Acne typically scales as 0{wn) where w is the maximum 
Hamming weight of the rows of the generator matrix of either Ci or C2 in standard form. However this 
standard construction may be sub-optimal, since by bringing the generator matrix in standard form one can 
increase the maximum weight of its rows. 

For Steane-EC the full verification of the ancilla block requires other ancillas blocks; a fully fault-tolerant 
verification would give a pessimistic scaling of ^vcr = 0{wnt). However it is not necessarily desirable to 
have strict fault tolerance as long as the total probability of low-weight faults that produce faults with weight 
t + 1 or more is low, see the discussion in Section |4] On the other hand for increasing ?i the number of 
verification rounds should at least be increasing with ?i, perhaps 0(log n) would be sufficient. If we assume 
that ^mai scales similarly as A, it follows that if we look for linear-scaling Amai we need to look at code 
families which have simple encoders, scaling linearly with n. This seems only possible for stabilizer codes 
with constant weight stabilizers, such as quantum LDPC codes ifTSl and surface codes or for the Bacon-Shor 
codes (which has encoders that use 0{n) 2-qubit gates). 

For the Bacon-Shor and surface codes the distance t does not scale linearly with n (but as \/n). Nonethe- 
less, the work in |[T2l shows that the effective distance for the surface codes does scale linearly with the 
block size, since there are very few incorrectible errors of weight 0{t). For the Bacon-Shor code family, 
where one has less syndrome information, this behavior has not been observed |[T9l (see also Figure |9]). 

For code famiUes with constant-weight stabilizers an interesting alternative to Steane-EC |[33l is the use 
of Shor-EC B3l where the syndrome corresponding to each stabilizer is extracted using a cat state or simple 
unencoded qubit ancillas. As for ancilla verification in Steane-EC, the syndrome extraction needs to be 
repeated to make the circuits more fault-tolerant. In Section \5\ we will see the effect on the threshold of 
using Steane-EC versus Shor-EC for the surface codes Il28ll29l . see Figure [TT] of Section |5] It is striking 
that the surface codes with Shor EC are the only known examples of a code family with a finite n —>■ 00 
threshold. This is despite the 0{ny/n) scaling of the total number of locations A of the Shor error correction 
circuit. 

2.2 Choice of Codes 

The codes that we have studied are listed in Table[T] All codes in this table ai^e CSS codes with the exception 
of the [[5,1,3]] code. Some of these codes have been previously analyzed by Steane in Ref. [S]. There 
exist various families of binary CSS quantum codes; the families are the quantum Reed-Muller codes, the 
quantum Hamming codes, the quantum BCH codes, the surface codes and the sub-system Bacon-Shor codes. 
In our study we consider only a single member of the quantum Reed-Muller family, a [[15, 1, 3]] code, since 
these codes typically don't have very good distance versus block-size ll22l . The [[15, 1,3]] code was first 
constructed in ll23l from a punctured Reed-Muller code RM(1, 4) and its even sub-code. It is the smallest 
known distance-3 code with a transversal T gate. 

We study various quantum Hamming and quantum BCH codes (see a complete list of quantum BCH 
codes of small block-size in ll24ll ) which are constructed from self-orthogonal classical Hamming and BCH 
codes respectively. We have chosen those codes that encode a single qubit and have maximum distance for 
a given block size. We have included the previously studied Bacon-Shor codes and the surface codes in our 
study. We have also included the concatenated 7-qubit code [[49, 1,9]] which we use in the way that was 
proposed by Reichardt in ll25l . see the details in Section [3TT] 

Another family of codes that has been proposed for fault tolerance ll26l are the quantum Reed-Solomon 
codes or polynomial codes. These are codes that are naturally defined on qudits. In this study we consider 
them as candidates for bottom codes. An alternative use is to consider them as top codes where one uses a 
bottom code to map the qubits onto qudits. In our study we assume that quantum information is presented 



Parameters 



Notes 



[[5,1,3]] 

[[7,1,3]] 

[[9, 1,3]], [[25, 1,5]], [[49, 1,7]], [[81, 1,9]] 

[[15,1,3]] 

[[13, 1,3]], [[41, 1,5]], [[85, 1,7]] 

[[21,3,5]] 

[[23,1,7]] 

[[47,1,11]] 

[[49,1,9]] 

[[60,4,10]] 

[[79, 1, 15]], [[89, 1, 17]], [[103, 1, 19]], [[127, 1, 19]] 



non-CSS five qubit code ||27| 

Steane's 7-qubit code (doubly-even dual-containing) fSl 

Bacon-Shor codes [il9;l 

Quantum Reed-MuUer code 12211231 

Surface codes |[28ll29l 

Dual-containing polynomial code on GF(2^) Q 

Doubly-even dual-containing Golay code (cycUc) |[34l 

Doubly-even dual-containing quadratic-residue code (cyclic) Il24i 

Concatenated [[7, 1, 3]] Hamming code ll25l 

Dual-containing polynomial code on GF{2^) Q 

BCH codes, not analyzed IIH 



Table 1 : A list of the codes included in our study. 



in the form of qubits and hence we will consider these codes as binary stabilizer codes. We specifically 
chose to include the [[21, 3, 5]] (a concatenated [[7, 1, 4]]8) and the [[60, 4, 10]] (a concatenated [[15, 1, 8]]i6) 
code from the family of dual-containing polynomial codes over GF{2^), because they are the smallest 
error-correcting polynomial codes in this family. 

We find it impractical to simulate the encoded CNOT gate for BCH codes in this table which have block- 
size larger than [[47, 1,11]], see 14.21 The threshold for these bigger codes will benefit considerably from the 
fact that t/n is quite high. Some semi-analytical values for the thresholds of these codes have been given 
in im. Even with good thresholds, these bigger BCH codes have limited applicability due to their large 
overhead. The bottom code should be picked to obtain a logical eiTor rate that is well below the thi^eshold of 
some good block code but only at the price of a moderate overhead. 



2.3 Universality 

Universality for CSS codes can in principle be obtained using the technique of injection-and-distillation 
llT0l[T4llT5l . Let us briefly review how one may perform fault-tolerant computation for CSS codes for which, 
of the Clifford group gates, only the CNOT and Pauli operations are transversal. If one is able to perform any 
Clifford group gate trans versally, including H and S, it is well known how to obtain a universal set of gates 
ll30l . Note that a CSS code with only its transversal CNOT gives us the ability to fault-tolerantly prepare 
the states {[+), |0)} and perform transversal X and Z measurements. However we do not necessarily have 
a fault-tolerant realization of the Hadamard gate H. 

In this case the problem of constructing fault-tolerant single qubit Chfford gates can be reduced to the 
problem of preparing the encoded \+i) oc |0) + i|l) ancilla OTTl . In particular, the gates S oc exp(— i7rZ/4) 
and Q oc exp(+i7rX/4) generate the single-qubit Clifford group and can be implemented given a |-i-i) 
ancilla, see Figure [T] and Figure |2l 
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Figure 1: The S gate using a \-\-i) ancilla. 

An encoded |-i-i) ancilla can be produced using the method of injection-and-distillation |[T5l[T4l . The 
distillation procedure for distilling an unencoded |-i-i) from seven unencoded \-i) is shown in Figure [3l In 
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Figure 2: The Q gate using a \+i) ancilla. 
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Figure 3: A circuit for the distillation of a |+i) ancilla from seven noisy |-i) ancilla's using the [[7,1,3]] 
Steane code. Steane error-correction (see also Figure |5]l is performed on seven unencoded qubits |— i). 
Based on the X and Z syndromes of the Steane code the en^ors are coiTccted, except that the X eiTors are 
con^ected using Y operators. The last step is to decode the Steane code to yield a single |+i) ancilla. The 
circuit can be found by starting from a deterministic distillation procedure for |+), applying an S gate to 
the output, and conjugating it back to the input. The order of error corrections is important since the second 
CNOT must be replaced by a controUed-Y before the order can be reversed. 

order to perform this distillation procedure in encoded form, one can generate an encoded noisy ancilla \-i) 
using Knill's idea of injecting a state in the code. The distillation circuit is then performed in encoded form. 

3 Error Correction Circuits 

Locations in a quantum circuit are defined to be gates, single-qubit state preparations, measurement steps, 
or memory (wait) locations. After one level of encoding, every location (denoted as 0-Ga) is mapped onto a 
a rectangle or 1 -rectangle (1-Rec), a space-time region in the encoded circuit, which consists of the encoded 
gate (1-Ga) followed by eiTor coiTcction (1-EC), as shown in Fig.lH For ti"ansversal gates, the 1-Ga consists 
of performing the 0-Ga's on each qubit in the block(s). 



1-Ga 



1-EC 



Figure 4: A 1-rectangle (1-Rec), indicated by a dashed box, which replaces a single-qubit 0-Ga location. 
The 1-Rec consists of the encoded fault-tolerant implementation of the 0-Ga (1-Ga) followed by an error 
correction procedure (1-EC). 



For the fault-tolerance analysis one also defines an extended 1-Rec or ex-Rec which consists of a rectangle 
along with its preceding l-EC(s) on the input block(s). Let us now discuss the circuits for en^or connection. 

Steane error correction for CSS codes (Steane-EC) is schematically shown in Figure |5] The |0) and 
1+) ancilla blocks in Fig. |5]can be prepared in the following way. First n qubits are encoded using circuits 
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Figure 5: Steane's error correction method for CSS codes involves coupling two encoded and verified an- 
cilla's to a block of data qubits. The ancilla qubits are then measured in the Z- or X-basis and the syndrome 
s is determined. From the syndrome s the corresponding Z or X enor is determined and the data qubits are 
corrected. 

derived from the generator matrix of a classical coset code of C^, see Appendix |A]f or details and definitions. 
The memory locations in the encoder aie determined using Steane's Latin rectangle method |[32i . discussed 
in more detail in Appendix lA. II Then the ancillas pass through a verification circuit. This error detection 
circuit measures the X and Z stabilizer generators of the encoded state some number R of times. For 
a |o) ancilla each round is given by the circuit in Figure [6] For dual-containing codes, the Hadamard- 
conjugate of the circuit is used for a |+) ancilla. If we detect any eiTors in any of the R rounds, the encoded 
state is rejected. Otherwise the state is accepted and used for syndrome extraction. We will consider L 
preparation attempts per ancilla and in our studies we will vary the parameters R and L, giving rise to 
different overheads. 

$ 



|0) ( z-basis 



X-basis 



0) — $^-( Z-basis 



Figure 6: The ancilla verification circuit for one round of error detection. First, the top encoded ancilla is 
verified against X-eiTors and it is determined whether Z = +1. If no X-eiTors are detected and Z = +1 the 
encoded ancilla is then verified against Z-en^ors using an ancilla I O) which is itself verified against X-en^ors. 



3.1 Specific Code Considerations 

Specific properties of a quantum code can often be used to simplify the enor-correcting circuits. This section 
discusses each family of codes and the optimizations we have implemented or the code properties that have 
been used to modify the EC and 1-Ga circuits. 

In general, we have opted to focus on the error-correcting properties of the codes rather than the pos- 
sible simplifications to the Steane-EC network. One of the reasons for this approach is that it is not clear 
whether verification circuits that perform the minimal number of checks are superior to verification circuits 
that perform many thorough tests. Furthermore, changes to the network are difficult to parameterize and 
systematically study because there are many possible choices and few are clearly the best. In addition, we 
believe that the overall trends observed in this paper are not altered by omitting these optimizations. 

Reichardt has suggested a generic optimization that uses different encoders for each logical ancilla in the 
verification circuit |[34l . This optimization can reduce the number of necessary rounds of verification and 
possibly decrease the probability of correlated enors at the output of the verification circuit, conditioned on 
acceptance. We do not use this optimization for any of the codes in this study. 

The Steane and Golay codes are constructed from perfect classical codes. Perfect codes have the property 
that every syndrome locates a unique error of weight u; < t. As Ref. Qj observed, some parts of the 
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en^or detection circuit can be removed for a CSS code constructed from perfect classical codes and the 
construction remains strictly fault-tolerant. Again we do not use this optimization. 

For the Bacon-Shor codes we don't use Steane's Latin rectangle encoding method, but rather the simpler 
method described in I19il . We do use the standard verification method for the bigger BS codes and not the 
simpler verification method in 1191 . 

For the surface codes we consider both Steane-EC and Shor-EC to understand their effects on the thresh- 
old. We use Shor-EC using bare ancillas as in |[T2l . This is fault-tolerant for surface codes on a 5 x 5 
lattice or larger as long as the syndrome measurements are repeated enough times. The number of repeated 
measurements could in principle be varied, but we choose to repeat the measurements I times ior a £ x i 
surface code, following |[T2l . 

The [[49, 1, 9]] concatenated Steane code is one of the CSS codes in our study whose network deviates 
from the construction described in the previous section. The preparations of |o) and |+) do not include 
a verification circuit. Instead each 7-qubit block has an error detection step after each [[7, 1,3]] -encoded 
logical gate |[34l . A 49-qubit ancilla is rejected if any of these error-detections detects an error. This implies 
that any single fault will be detected, so the circuit is fault-tolerant. In fact, any pair of faults is also detected, 
so that a third order event is necessary to defeat the error-detections. This way of using [[49, 1, 9]] is the 
one which Reichardt proposed. Unlike in his simulations we restrict ourselves to a finite number of ancilla 
prepai^ation attempts L, since we care about the total overhead. 

The polynomial codes that we consider are non-binary codes over 2"^ -dimensional qudits. We can choose 
the parameters of these codes so that when we consider each qudit as a block of m qubits, the Fourier 
transform and controlled-SUM gates are implemented by bitwise application of Hadamard and CNOT, re- 
spectively. In this setting, the code is simply a binary CSS code encoding m qubits which is constructed 
from a non-binary dual-containing classical code by concatenating using a self-dual basis of GF(2™). The 
advantage of such a construction is that we can decode the syndromes as if they were vectors over GF{2"^), 
allowing us to conect more higher-order eiTors than we could otherwise coiTcct as a binary code. To use this 
advantage, we do not need to change the way we construct the rectangles at all, only the way we interpret 
the classical measurement outcomes. 

[[5,1,3]] is the smallest distance-3 quantum error-con^ecting code and it is a perfect quantum code. 
Gottesman has shown how to compute fault-tolerantly with this code fV7\, and there have been some numer- 
ical studies of logical error rates using Shor-EC [1161 . To our knowledge the threshold for this code has never 
been determined. Unfortunately, there are no two-qubit transversal gates for [[5, 1, 3]], so it is necessary to 
construct a two-qubit gate from a three-qubit gate such as the T3 gate. The gate T3 is defined by the fol- 
lowing action on Pauh operators: XII -^ XYZ, IX I -^ YXZ, IIX -^ XXX, ZII -^ ZXY, IZI -^ 
XZY, HZ —>■ ZZZ. This gate is a Clifford gate that can be combined with stabilizer-state preparations 
and transversal Pauli measurements to yield any gate in the Clifford group ifTTl . Specifically, CNOT, S, and 
Cyc gates (and their inverses) can be constructed from the T3 gate in this way. Here Cyc= SHSH acts as 
X — > y ^ Z — > X. The fault-tolerant implementation of T3 is shown in Figure [T] 

The [[5, 1, 3]] construction also differs from other CSS constructions because we use Knill (or teleported) 
eiTor coiTcction (Knill-EC) lITOl . In our study we will simulate the extended T3-rectangle assuming that the 
logical Bell pairs of Knill-EC are perfect. We do this since it is simpler and shows that even using perfect 
logical Bell pairs the threshold is not very good, see Section [5] For [[5,1,3]] the R and L parameters are 
replaced by NC and NB, denoting the number of cat state preparation attempts per Pauli measurement and 
the number of logical Bell state preparation attempts per error coiTcction, respectively. A circuit to prepare 
and verify encoded Bell pairs for Knill error correction for [[5,1,3]] is shown in Figure [T6]in Appendix IB] 

The construction for the [[15, 1, 3]] Reed-Muller code is entirely standard. Since this code is not con- 
structed from a dual-containing classical code, the lo) and |+) encoders are not simply related by a transver- 
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Figure 7: The encoded implementation of T3 (with an additional permutation of the blocks qi,q2, qs) using 
the gates CNOT, S, Cyc and Y (and inverses). Each gate in the circuit is applied transversally. The circuit is 
only a logical operation after completing all of the gates, i.e. CNOT and S are not valid transversal gates for 

[[5,1,3]]. 



sal Hadamard gate. For the same reason, the code can correct more X enors than Z eiTors. The most 
interesting feature of this code is that T is a transversal gate |[23ll35]| . but this does not enter directly into our 
analysis of the threshold. 



4 The Aliferis-Gottesman-Preskill (AGP) Method 



According to (T] a rectangle is correct if the rectangle followed by an ideal decoder is equivalent to the ideal 
decoder followed by the ideal gate (0-Ga) that the rectangle simulates: 



COlTCCt 

1-Rec 



ideal 
1 -decoder 



ideal 
1 -decoder 



ideal 
0-Ga 



As said before, an extended rectangle (ex-Rec) consists of a 1-Ga along with its leading and trailing error- 
corrections. The extended rectangles make an overlapping covering of the circuit. A set of locations inside 
an ex-Rec is called benign if the 1-Rec is con^ect for any set of faults occurring on these locations. If a set 
of locations is not benign, it is malignant. The design principles of strict fault-tolerance are described in 
pictures in Sec. 10 of |[T1. If these properties hold for the 1-Ga and 1-EC, these gadgets for a [[n, 1, d]] code 
with t = \_{d — l)/2j are called t-strictly fault-tolerant. The important consequence of these conditions 
is that for a [[n, /c,d]] code with t-strictly fault-tolerant constructions one can show that any set of t or 
fewer locations in the ex-Rec is benign. A construction is called weakly fault-tolerant when, for a code that 
can correct t errors, sets of s < t locations can be malignant. Weak fault-tolerance is a useful concept in 
optimizing thresholds since weakly fault-tolerant circuits can be more compact than strictly fault-tolerant 
circuits, hence allowing for fewer fault locations and a potentially better threshold. On the negative side, 
weak fault-tolerance allows some low-weight faults to be malignant but if the number of such faults is small 
then the threshold is not much affected. 

All our fault-tolerant schemes are 1-strictly fault-tolerant implying that single faults can never be malig- 
nant. More precisely, any single fault in a 1-EC or a 1-Ga never propagates to become a weight-2 eiTor in 
a block. In Steane-EC when we prepai^e ancillas with at least two attempts {L > 2) and one en^or detection 
stage {R = 1), we eliminate malignant faults of weight 1. For R = I the EC is not 2-strictly fault-tolerant 
since there may be a pair of faults, one in each of the first two encoders, generating a high weight (possibly 
higher than t) error that passes the error detection circuit undetected. Since the number of these events is 
quite rare, they will not contribute much to the failure probability. It is possible to show that R > t and 
L > i + 1 is necessary for t-strict fault-tolerance for a code that can correct t errors by continuing the same 
argumeno For a specific t-error-correcting code the actual values required for R and L depend on how each 
encoding circuit propagates errors. 



"However, the standard verification stage would need additional error detections on the bottom ancilla pair for R = t and 
L = t + 1 to be both necessary and sufficient for t-strict fault-tolerance. 
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Let us review why the extended rectangle is the central object in a fault-tolerance analysis. An encoded 
circuit where the physical gates (0-Ga) have been replaced by rectangles can also be viewed as an en- 
coded circuit with 0-Ga's with a different error model. This can be achieved simply by inserting perfect 
decoder-encoder pairs between the rectangles, see lUl. In an ex-Rec with malignant faults, the rectangle will 
correspond to a faulty 0-Ga, whereas for benign faults the rectangle will correspond to a perfect 0-Ga. The 
reason that one has to take into account an ex-Rec and not merely a Rec is that faults in the leading 1-EC 
can combine with faults in the 1-Rec to produce malignant faults. For example, a single error in a leading 
1-EC does not cause a failure of the rectangle in which it is contained; this enor however can combine with 
later eiTors to give rise to a logical eiTor. At the same time the presence of a logical error in the leading 1-EC 
which maps one codeword onto another will not affect the failure rate of the rectangle that comes after it 
since the state which enters this rectangle is a codeword. Hence in the extended rectangle method there is 
no double-counting of errors. Instead, it is an efficient method to handle the effect of incoming errors and is 
likely to give a very tight estimate of the threshold if no other assumptions or simplifications are present. 

In principle one may think that one would also need to be careful about the effect of incoming errors into 
the ex-Rec; perhaps an incoming error could combine with a seemingly benign fault in the ex-Rec and give 
rise to an incorrect rectangle. Thus perhaps one has to consider the malignancy of sets of faults given a 
possible worst case input to the extended rectangle. 

However, one can argue for stabilizer codes and for deterministic (to be defined below) eiTor-coiTcction 
that malignancy does not depend on incoming errors to the ex-Rec. To show this, let us first review the 
formalism of stabilizer codes. A stabilizer code is the +1 eigenspace of an Abelian subgroup of the Pauli 
group Pn which contains all n-qubit tensor-products of the Pauli operators {X,Y, Z,I}. The normalizer 
N{S) <^ Pn of S is defined as N{S) = {E\\/s G S,EsE'f G S}. For Pauli operators (which either 
commute or anti-commute with each other) N{S) is the simply the group of Pauli operators that commute 
with any element in S. Any element of N{S)\S is a logical operator mapping codewords onto each other. 
All other Pauli operators P ^ N{S) anti-commute with at least one element in S and map a code word 
outside the code space indicated by a non-zero syndrome. Thus the Pauli group P„ can be partitioned into 
cosets of N{S) and each of these cosets is labeled by a different syndrome. The lowest-weight member of 
each coset is called the coset leader. Standard syndrome decoding finds, for each given syndrome, a coset 
leader with lowest weight and chooses this as the error correction. Thus the low-weight (non-degenerate) 
correctable errors correspond to distinct syndromes whose coset leader corrects the error. For high-weight 
errors Ei, all we can say is that EiEcon-cct G ^{S) since Ei and -Ecorrect have the same syndrome. 

Now let us consider the issue of incoming errors to an ex-Rec and assume the following properties of 
stabilizer eiTor coiTcction. First, we assume that the part of the 1-EC circuit which couples any ancillas to 
the incoming data is deterministic, i.e. does not depend on any incoming error on the data. The choice of 
which ancillas to couple may depend on some error detection or ancilla verification. This property holds 
for many but not all eiTor-correction circuits; it does not hold, for example, when the number of repetitions 
of syndrome extraction depends on the value of these syndromes. This property does hold for the circuits 
used in this paper. Furthermore, given a stabilizer S and the incoming error E^^ on an encoded state, let 
the 1-EC be such that the syndrome of the 1-EC uniquely determines in which coset of N{S) in the Pauli 
group the enor Ei^ lies. In this sense the 1-EC must be complete en^or correction for the code that is 
used. For example, if for a CSS code the 1-EC only does Z error coiTcction whereas X eiTors can map the 
state outside the code space, the syndrome information effectively partitions the Pauli group into cosets of 
N{Sx)Pn{X). Here Pn{X) is the subgroup of Pauli operators that only contain X and / and N{Sx) is the 
normalizer of the stabilizer subgroup Sx with only X and / Pauli operators. In this case the syndrome does 
not uniquely assign the incoming eiTor to a coset of A^(5). Thirdly, upon any incoming eiTor Ein a perfect 
1-EC determines a syndrome that con^ects E'ln modulo a logical error (given by an element in N{S)). This 
is a basic property of stabilizer error correction as described above. 
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Let then the incoming state to an ex-Rec be a state in the code-space of the stabilizer with an additional 
error E-m- We want to show that the state that comes out of the leading 1-EC is again some state in the code 
space with an additional error E'out that only depends on the errors inside the 1-EC, Ecc, i-c E'out = /(^oc) 
where / is independent of Ei^- Any 1-EC circuit for stabilizer codes can be implemented with Clifford gates. 
Given an incoming en^or Ei^ and eiTor inside the 1-EC Ecc, it follows (because a 1-EC for any stabilizer 
code can be implemented with Clifford gates) that the 1-EC has syndrome s{Einhi{Ecc)) where hi is a 
function independent of Ei^. Based on the syndrome the correction step will be some incorrect = ^in^i (^ec) 
mod N{S). Before eiTor coiTcction the data has en^or /i2(£'ec)£^in where /i2(-E'cc) is the part of Ecc that has 
propagated to the data. After en^or correction the data thus has error /i2(-Ecc)^i(^cc) mod N{S). We 
strip off the logical error in N{S) and identify Eo^^t = h2{Ecc)hi{Ecc)- Note that when the EC is not 
deterministic, the functions hi and /12 can depend on E'jn. D 

We discuss the explicit decoding of the error syndromes for each code in Appendix iBl 

4.1 Monte-Carlo Implementation of Method 

Given the AGP method the numerical problem to be solved is whether a Rec is correct given a set of faults 
in the ex-Rec containing it. This set of faults is generated using depolarizing noise with error probability 
Po for each location in the circuit. We calculate the failure rate of the ex-Rec, i.e. the probability that the 
Rec is not correct, for fixed R and L. This implies that sometimes there are no verified ancillas available 
for a 1-EC. If this happens for any of the 1-ECs inside the extended rectangle, we call this a failure of the 
extended rectangle. We do this for all codes except for Reichardt's use of [[49,1,9]]. The reason for this 
exception is that for Reichardt's method the failure rate of ancillas may be rather high. If we let failure 
of having verified ancillas in the leading 1-EC determine failure of the rectangle after this 1-EC, we are 
possibly double-counting eiTors. Hence in Reichardt's method we replace any failed ancillas in the leading 
1-EC by a perfect ancilla and do not call failure. As the results show, even under this assumption, the 
[[49,1,9]] concatenated code with error-detection and finite resources is not a great performer. 

In general, our assumption on the effect of failed ancilla preparations may make our estimates for the 
pseudo-thr^eshold for the EC circuits slightly more pessimistic. 

We will estimate the failure rate of a CNOT ex-Rec, since this is by far the biggest circuit among the 
Clifford ex-Recs. As we argued in Section 1231 the non-Clifford (and possibly other Clifford) gates will be 
implemented via injection-and-distillation so that their implementation will not affect the threshold. Pauli 
gates are not applied within a Clifford ex-Rec because they can be stored in classical memory as the Pauli 
frame and applied only prior to the execution of non-Clifford gates. 

Given a fixed R and L, we will estimate the failure rate pi{po) = — ^ where A'^faii is the number of 
Monte-Carlo samples that fail (i.e. the number of times we simulate the extended rectangle with randomly 
generated faults and observe that the rectangle is incorrect) and A'^ is the total number of runs. With high 
probability this estimated pi lies within one standard deviation of the real pi . In this way we collect data 
points pi(po) for different values of po- We then take these points as the mean of a normal distribution for 
each pq. We sample from these normal distributions and for each set of samples we determine a small degree 
polynomial piipo) fitting the samples. The equation pi(po) = Po gives us a sample of the threshold and we 
put an error bar on this result by calculating the standard deviation of the obtained threshold samples. 

The way we test for correctness of a rectangle for a given pattern of faults in the ex-Rec is as follows: Let 
Eout be the outgoing error of the leading 1-EC. We use syndrome decoding to determine the coset leader 
E'load corresponding to the coset of N{S) in the Pauli group of this -Eout- We propagate this -Biead through 
the rectangle, let /(-Bicad) be the outgoing error on the data. We follow the rectangle by an ideal decoder 
and let incorrect be the correction suggested by the ideal decoder. Then we test whether incorrect /(^'icad) 
commutes with both X and Z. If it does, we infer that no logical faults occurred, hence the rectangle was 
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con^ect. Otherwise we call failure. 

An alternative way of using the AGP correctness criterion is to count or sample malignant fault sets. This 
method is advantageous if one wants to estimate the threshold for worst-case adversarial noise. In such an 
application, one fixes the number of faults and counts or samples how many sets with this fixed number of 
faults are malignant. The failure rate pi is a polynomial in po with factors that ai^e determined using the 
malignant set counts (or estimates of these counts determined by sampling). For codes with large distance 
this method becomes cumbersome, as the total number of possible fault-sets of size t + 1, (^ , j^) becomes 
large. Here A is the total number of locations in the ex-Rec. This sampling method is difficult but still 
possible for the Golay code, but it is not possible for codes of higher distance. 

The advantage of the malignant set counting or sampling method is that one gets an upper bound on pi 
for arbitrary values of po- This makes it possible to estimate pi even for very small values of po- We use 
the Monte-Carlo simulation to estimate pi, but the number of samples required becomes quite large if one 
wants to estimate pi with good relative error for small po. In such cases we extrapolate the values for pi 
obtained from larger values of po, see Section [ 

4.2 Software and Computer Use 



On the website 11361 one can find a set of software tools that have been developed for this and other future 
fault-tolerance projects. The quantum circuits for the CNOT ex-Recs based on CSS codes are highly struc- 
tured and can be mechanically assembled in 0{n^) time for block-size n given the classical codes Ci and 
C2. We have used MAGMA |i37] and/or GAP liMl (using the GUAVA package ||39l) to construct quantum 
codes and compute their parameters. The code stabilizers are copied from the computer algebra programs 
into our circuit synthesis and simulation programs, where they are again verified to have the required com- 
mutation relations. 

The simulation and circuit synthesis programs are implemented in C-i-i- and use MPI BOl for communi- 
cation during embarrassingly parallel tasks. The project is entirely open source and makes use of preexisting 
open source libraries such as a Galois field implementation PTI and a weighted matching algorithm B2l . 
Importantly, the same functions and procedures are used in the Monte-Carlo simulation. This gives us 
increased confidence in the simulation output. 

The symmetries of the pair count matrix for some distance-3 code circuits and the lack of single-location 
mahgnancies in all circuits strongly suggests that our circuit constructions are indeed fault-tolerant against 
single errors. Furthermore, we strictly check all input and intermediate results for consistency at runtime. 
The programs can be optimized and further improved, but we leave this to future work and encourage 
development by making the code publicly available fi36il . 

The simulations were carried out on a relatively small allocation of Blue Gene L at the IBM T J. Watson 
Research Center. Typically we used between 64 and 256 PowerPC 440 700 MHz CPUs. Each pair of 
CPUs had access to 512 Mb of local memory. Using 256 CPUs gave us roughly a factor of 50 speed-up 
over a typical single-processor desktop machine. The entire process of development and debugging took 
many months, but we estimate that all of the data could be retaken in several weeks with these computing 
resources. 

5 Results 

Tables |3l in [21 m and|2]list the complete set of results of our studies. Our results are obtained assuming that 
all locations including memory locations suffer from noise at the same noise rate, unless specified otherwise. 
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Figure 8: Level- 1 depolarizing pseudo-threshold for three families of codes with perfect ancillas for Steane- 
EC: surface codes, dual-containing codes, and Bacon-Shor codes. This plot indicates that under no circum- 
stances can thresholds reach 1% for the codes in our study. The data points are connected by Unes merely 
as a guide to the eye. 

5.1 Perfect Ancillas 

In our first study, and only in this Section, we assume that ancillas for Steane error correction can be prepared 
flawlessly, see Figure [8] In such a scenario, the thixshold is largely determined by the eiTor-coiTcction 
properties of the code (see also the analysis in B4l ). in particular- its (effective) distance. For families of 
quantum error-correcting codes in which the effective distance is linear in the block-size, we expect the 
threshold to be a monotonically increasing as a function of n, see Section 12.11 In Figure [8] and Figure 
|9]we have plotted the pseudo-thresholds for three families of codes: surface codes, some dual-containing 
codes, and Bacon-Shor codes. The surface codes and Bacon-Shor codes apparently have fairly good distance 
properties, even though there is some decline in the BS code family for large ?i. Figure [8] shows we cannot 
expect a threshold over 1% for the codes we have studied using Steane-EC - introducing noise realistically 
into the ancilla preparation circuits cannot increase the pseudo-threshold. Note that if we do Shor-EC on the 
surface codes we cannot expect thresholds exceeding about 3%, see the arguments in |[T2l . 

When we assume that the logical Bell pairs of Knill's circuit can be prepared flawlessly, the level- 1 
pseudo-threshold of the [[5, 1, 3]]'s T3 gate is (2.0 it 0.1) x 10~^|j. This is roughly an order of magnitude 
below the Steane code with perfect ancillas. 



''The pseudo-threshold in this case is the point at which the failure rate pi of a T3 ex-Rec is the same as the base error rate po 
of all elementary gates in the ex-Rec. 
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Figure 9: Level- 1 depolarizing pseudo-threshold for surface codes and Bacon-Shor codes using perfect 
ancillas for Steane-EC. 

5.2 Pseudo-Thresholds 

In Figure[lO]we tabulate for each code the maximum pseudo-threshold over the various choices of R and L. 
The maximum overall pseudo-threshold (2.25 ± 0.03) x 10~^ is attained by the Golay code with L = 30 
and R = 1. The two code families, Bacon-Shor and surface, both attain a peak threshold and then decline 
when we use Steane-EC. The peak Bacon-Shor code is the [[49, 1, 7]] at (1.224 ± 0.005) x 10"^ with L = 9 
and R = 1. The peak surface code (using Steane error correction) is [[41, 1, 5]] at (1.008 ± 0.008) x 10"'^ 
at L = 30 and R = 1. Interestingly when we use Shor-EC for the surface codes the performance is quite 
different. Shor-EC does not do as well as Steane-EC for small block sizes, but for larger block size Shor-EC 
gives a threshold that asymptotes to a finite value, see Figure [TT] For small block size the thresholds of the 
surface codes are not as good as of some other codes such as the Golay code and the Bacon-Shor codes. 

It is clear from the data that the pseudo-threshold increases with increasing L. Our main interest in this 
study is in circuits with small overhead and hence with a relatively small number of preparation attempts L. 
In various cases the thresholds stated for finite L will be thus be lower than the one in the L —>■ oo limit. 
Notably, this occurs for the [[49, 1, 9]] code, where we expect thresholds approaching 1 x 10~^ with many 
more ancilla preparation attempts |[25l . In other cases one can take the perfect ancilla results in Figure [8] and 
the Tables as upper bounds on the L — > oo pseudo-threshold. 

The use of weakly fault-tolerant circuits, i.e. small R and small L is meant to get improved thi^eshold 
behavior for finite resources. The best performance of a code can be expected for L — > oo, since one would 
always use ancillas which passed verification. When L — > oo, it is clear that, -at least below threshold-, 
optimal performance is likely to be achieved when R is taken as large as possible, since then an ancilla is 
maximally verified. However, at finite L, a larger R will let more ancillas fail and hence increase the chance 
for an extended rectangle to fail (remember that we, pessimistically, call an extended rectangle failed if 
we don't have ancillas for EC). Hence for small L, small R, weakly fault-tolerant verification circuits can 
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Figure 10: Level-1 depolarizing pseudo-threshold versus block size. The other codes are the [[5,1,3]] 
non-CSS code, the [[15, 1,3]] Reed-Muller code, the [[49, 1,9]] (dual-containing) concatenated Steane code 
using L = 15 attempts to prepare using error detection at level-1, and the [[60,4,10]] (dual-containing) 
concatenated polynomial code using L = 20 attempts to prepare ancillas. 

outperform circuits with the same L and larger R. 

5.3 Influence of Storage Errors 

In Figure [12] we replot the pseudo-threshold versus block-size when storage error rates (on memory loca- 
tions) ai^e zero. The peak pseudo-threshold increases to (3.33 it 0.02) x 10""^ . The Figure shows that storage 
errors do not influence the pseudo-threshold appreciably. The Bacon-Shor codes are least affected by stor- 
age errors because the encoding circuits are extremely simple. The non-CSS [[5, 1,3]] code is most greatly 
affected because storage errors can enter into the T3 gate sub-circuit, the |0) encoders, and the cat-state 
encoders at many locations. 

5.4 Logical Error Rate versus Overhead 

The threshold is an extremely important figure of merit for fault-tolerant circuit constructions. But prac- 
tically speaking, we ai^e also interested in how quickly the error rate decreases if the initial enor rates are 
low enough for a given overhead. Figures [131 [Ml and [151 plot the probability of failure of a CNOT ex-Rec 
(defined in Section \47\\ versus the number of physical CNOTs in a rectangle at po = 10"^. Even though 
there are other measures of overhead, such as total number of qubits involved in the rectangle or the depth 
of the rectangle circuit, we have chosen the number of CNOTs per rectangle as an estimate for the overhead 
since it approximately captures the total size, i.e. depth times width, of the rectangle. 

The Golay code achieves the lowest logical eiTor rate for codes with fewer than O(IO^) CNOT gates per 
rectangle, and that rate can be further reduced by increasing the number of verification rounds to i? = 2. 
There is a clear tradeoff between the number of physical CNOTs per rectangle and the logical error rate. 
We note that given the lack of code specific optimizations, the achievable overheads for various codes may 
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Figure 11: Surface code level- 1 depolarizing pseudo-threshold versus £ for £ x ^ surface code (the block-size 
n = €' ^ [i — 1)^). The ex-Rec is a transversal CNOT gate with i sequential Shor-EC steps per EC. The 
pseudo-threshold increases with i and is expected to approach a constant value in the limit of large i, unlike 
the other codes in this study. 

be somewhat less than what is estimated here. For the Golay code and the Bacon-Shor codes for example, 
the overhead may come down by at least a factor of 2 by using simplified verification circuits. Viewed an 
a log-scale such decrease in resources is relatively small. We also see in Figure [14] that the approximate 
expression for the failure rate, Eq. ([T|), gives a pretty good estimate of the actual failure rate. 

Some of the eiTor rates plotted in Figure [13] were extrapolated from eiTor rates at higher values of po and 
may only be rough indications of the actual eiTor rates. For small values of po the logarithm of the eiTor 
rate pi(po) is expected to be approximately linear in po- We extrapolate from a least-squares fit to this line. 
Tables|3]|4] and[5]indicate these extrapolated rates by enclosing them in square brackets. The extrapolations 
are only plotted for the 5x5 surface code and the 9x9 Bacon-Shor code and are plotted without eiTorbai^s 
for these two points [j. 

For the Golay code we have looked at the behavior of the threshold for i? = 1, 2, 3. One important 
empirical observation is the following. The pseudo-threshold can increase slightly while the logical error 
rate for po = 10~^ remains the same. This happens for the Golay code when R = \ and L is increased 
from 10 to 20. Furthermore, the pseudo-threshold can decrease while the logical error rate decreases too. 
This also happens for the Golay code when L = 10 and R is increased from 1 to 2. This suggests that 
the pseudo-threshold value is sensitive to higher order effects that quickly become negligible at lower error 
rates. Thus a desired logical error rate may be achievable with significantly fewer ancilla resources L than 
are necessary to maximize the pseudo-threshold, provided the initial error rate po is not too close to the 
pseudo-threshold. 

In Figure [14] we have also added Knill's C4/C6 Fibonacci scheme iflOl at 2 and 3 levels of encoding. 



Unfortunately, using the same method by which we obtain error estimates for our calculated pseudothresholds, we find error 
estimates that are an order of magnitude larger than these extrapolated values. In the few places where we make these extrapolations, 
the results should only be taken as rough indications of the actual error rates the codes can attain. 
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Figure 12: Pseudo-thresholds versus block size for Steane-EC and Knill-EC circuits, comparing the case 
where the memory failure rate equals the gate failure rate with the case where the memory failure rate is 
zero. Naturally the difference is smallest where we have taken advantage of simple encoders as those for the 
Bacon-Shor codes. 

These data points are derived from his paper B At level 2 the detected error rate of the logical CNOT is 
(1.06 ± 0.01) X 10~^ and at level 3 the detected eiTor-rate is (2.18 ± 0.02) x 10~^. 

The plot shows that [[9,1,3]] is still better than the C^/Cq scheme in terms of overhead, but the C^/Cq 
Fibonacci scheme definitely beats [[7,1,3]]. The next two Bacon-Shor codes fill a void between C^/Cq level 
2 and C^/Cq level 3. 

For the surface codes (see Fig. [TS] ) we note that the error rates are relatively high compared to other 
error-correcting codes with comparable numbers of CNOTs per rectangle. However one should remember 
that the circuits for the surface codes are already spatially local in two dimensions whereas the circuits for 
any of the other codes, for example, the Golay code, are not. 

6 Discussion 

In our study we have considered bottom codes and their performance in a bottom-top code architecture. Our 
best threshold around 2 x 10~^ is seen for the Golay code, and many other codes both larger and smaller 
were studied and found to have much worse thresholds. An important figure of merit is the logical error rate 
versus overhead curve which shows that the Bacon-Shor codes are competitive with Knill's C^/Cq scheme 
at a base en^or rate of 10^^. 

We have seen that the constraint of finite resources, i.e. limited R and L in Steane EC, can considerably 
and negatively impact noise thresholds. An example is Reichai^dt's estimate for [[49, 1, 9]] when L ^ oo 



Note that his error model is shghtly different from ours but we take the dominating physical CNOT error rate to be the same. 
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Figure 13: Level- 1 logical en^or rate (probability of failure of a CNOT ex-Rec) versus the number of CNOTs 
per rectangle. The line connects the points of the best performing codes. Points with the same shape (color 
online) belong to the same code but have different circuit parameters. The error rates are evaluated at a fixed 
Po = 10"'^. The results C4/C6 scheme of |[TOl are shown for comparison. 



versus our estimates for this construction at small L. For code families with low-weight stabilizers, Shor 
EC may give rise to thresholds which grow with block-size. For code families which do not have this 
property, e.g. general quantum BCH codes, the limit resource constraint on R and L and the complexity of 
the encoding circuits start pushing the thresholds down beyond some peak performance block-size. 

In this landscape of codes and their performances, one of the missing players is the surface code scheme 
of (TT\ in which many qubits are encoded in one surface code and the CNOT gate is done in a topological 
manner. In principle, the possible advantage of this scheme is that if one uses enough space (meaning 
block size) one would reach the asymptotic threshold of a simple EC rectangle (no 1-Ga). We have in fact 
analyzed an ex-Rec where the Rec is only Shor-EC on a £ x £ surface and we find that this asymptotic 
memory threshold for £ ^ oo is about 3.5 x 10"'^. This is a factor of two lower than the number stated in 
lOn . For finite block size one could analyze a CNOT ex-Rec for this topological scheme just as for the other 
codes. Like all the other codes, the topological scheme will have a trade-off between overhead and logical 
error rate. It will be interesting to see whether topology and block coding provide an efficient way of using 
resources and how it compares to a local version of a bottom-top architecture discussed in this paper. 

For a bottom-top architecture it will be important to study the performance of top codes in order to 
understand at what error rate one should switch from bottom to top code and what total overhead one can 
expect. Concerning a choice of top code we expect the following. First of all, given the constructions of 
ll45l . one can expect that a [[n, k, d]] block code has a threshold comparable to a [[n, 1, d\] code. Secondly, 
the networks in |i45] show how to do logical gates on qubits inside the block codes using essentially gate- 
teleportation and Knill-EC. One issue of concern for block codes is the complexity of the encoding circuit 
as a function of block size. It would be highly desirable to consider block codes with EC circuits that are 
lineal" in n, otherwise one would expect the threshold to decline as a function of n. 

There is another desirable property of top codes which relates to the transversality of gates. In order to 
minimize overhead, it is desirable that the T gate is transversal for the top code. The reason is as follows. 
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Figure 14: Level-1 logical en^or rate (probability of failure of CNOT ex-Rec) versus the number of CNOTs 
per rectangle for the best performing codes. The subset of data plotted here was chosen so that the eiTor 
rate decreases monotonically with the rectangle size and there is no code with lower error rate at a given 
rectangle size. The error rates are evaluated at a fixed po = 10~^. 

In order to have maximal freedom in picking a bottom code we will only require that it has a transversal 
CNOT. Thus all other gates, in particular T = e^'^^l^ and the phase gate S, should be either performed 
by more complicated fault-tolerant 1-Ga or be implemented by the injection-and-distillation scheme. If 
the fault-tolerant circuits for these non-transversal gates have poorer thresholds than the CNOT gate, then 
the injection-and-distillation scheme is the preferred solution. In the injection-and-distillation scheme, the 
obtained en^or rates of the encoded and distilled ancillas will be limited by the noise rates on the Clifford 
gates which distill the ancillas, since the Clifford distillation circuit is not fault-tolerant. Assume we teleport 
the ancillas into Ctop ° Cbot and get Clifford gates with O(10~^^) error rate. Since a circuit such as Bravyi- 
Kitaev distillation uses 0(10'^) gates, the error rates of the distilled ancillas can be as high as O(10~^^). Thus 
by these schemes the, say, T eiTor rate is always trailing the transversal gate eiTor rates. But assume that the 
T gate is transversal for the top code and thus we only inject the T ancillas into Cbot- Then even though 
the once encoded gate Cbot(T) has an error rate of, say, O(10~^), the twice-encoded gate Ctop ° Cbot(T) 
will mostly likely have an error rate similar to other Clifford gates since there are very few Cbot(T) in the 
twice-encoded gate compared to the EC parts. Of course, the top code will have other non-transversal gates; 
for example the [[15,1,3]] code has a transversal T gate but not a transversal Hadamard gate. If the bottom 
code has a transversal Hadamard gate, we can implement a fault-tolerant H in Ctop ° Cbot by using the 
fault-tolerant non-transversal gadget for the Hadamard gate in Ctop and implementing the resulting Clifford 
gates. This shows that there are possible constructions which would allow all gates needed for universality 
to be implemented with approximately the same, low, error rate, while the noise threshold of such scheme 
is determined by the noise threshold for the transversal Clifford gates. 
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Figure 15: Level- 1 logical enor rate versus the number of CNOTs per rectangle for the i x i surface codes, 
i = 5,7, 9. It is expected that the error rate decreases exponentially as i increases for fixed po = 10~^. 
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A Various Aspects of Steane-EC 

A binary [[n, k, d]] CSS code CSS(Ci, C2) is constructed using two classical linear error correcting codes 
C2 ^ Ci and has the codewords: 



Yj \c + a) where a G Ci/C;^. 

C2 I ceC;^ 



(5) 



Each row r of the parity check matrix of C2 gives the stabilizer generators X{r), and each row s of the 
parity check matrix of Ci gives stabilizer generators Z{s), where U{r) = U\^ (8> • • • (8) U"^"- It is easy to 
check that \a) is a simultaneous eigenstate of these stabilizer generators: (1) a row r of the parity check 
matrix of C2 must be an element of C^, so adding it to each codeword in the superposition \a) leaves the 
state unchanged, and (2) every codeword in the superposition \a) is an element of Ci, so it must pass the 
parity checks of Ci. A basis of the 2^ cosets of C^ in Ci corresponds to logical X operations X{a) on the 
code space because X{a) |0) = \a). Similarly, a basis of the 2^ cosets of Ci in C2 corresponds to logical 
Z operations Z{h) since Z{h) \a) = (—1)^'* \a). We can choose these bases such that the logical operators 
obey the commutation relations of the fc-qubit Pauli group. 

A special case of the CSS construction occurs when C2 = Ci, in which case Ci is a dual-containing 
code. The X and Z stabiUzer generators have identical supports and the Hadamard H gate is transversal. 
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If in addition the weight of each stabilizer generator is a multiple of 4, Cj*- is called doubly-even and the 
quantum code has a transversal S gate. The code does not have any transversal gates outside of the Clifford 
group B6ll in this case. 

Steane en'or correction for a CSS code C55(Ci, C2), C^ C Ci uses |+) and |0) ancilla states. These 
states can be encoded directly from the generator matrices of Ci and C2, respectively, according to a well- 
known procedure. The generator matrix G has n columns and ki rows for Ci orn — /c2 rows for C^, and the 
quantum code encodes k = ki + k2 — n qubits. Gaussian elimination puts a generator matrix into standard 
form G = {I\A) where / is an identity matrix and A is a binary matrix. The ith row of the generator matrix 
specifies the controls and targets of Wi CNOT gates, where Wi is the weight of the row minus one. In the next 
section we discuss how to implement this circuit in a way that minimizes the number of memory locations. 
The depth of the resulting CNOT circuit is w = maxjiOj}, assuming equal cost for any pair of qubits to 
communicate. The number of fault locations in an encoder is summarized by the following expressions: 

^enc("'5 ^1, ^2, w) < n + wma.x{ki,n — /C2), no memory noise 
Ae„c{n, ki,k2,'w) < n + wn, memory noise. 

For particular- states, different scaling is possible. For example, for the Bacon-Shor codes one can make the 
encoded ancillas using 0{n) 2-qubit gates. In general, any unitary stabilizer circuit has an equivalent circuit 
with O(n^) gates and O(logn) depth P8l . 

One method of verifying the encoded ancilla against low-weight correlated errors is to use transversal 
gates to perform error detection. One possible enor detection method consumes three additional ancilla and 
uses 3 transversal CNOT gates and 3 transversal measurements. The cost of verifying is: 

Ays^{n, ki, k2, w, R) < i?(3Aenc(?^5 ki, k2,w) + 6n), no memory noise 
Avsr{'>T',ki,k2,w,R) < R{3As„c{n,ki,k2,w) + 6n) + n, memory noise. 

Again, these expressions assume equal cost for any pair of qubits to communicate. 

Finally, we can write expressions for the total number of fault locations in a CNOT extended rectangle 
using Steane error connection: 

A{n, k,w, R) < 8Af.nc{n, ki,k2,'w) + 8j4ver(^, ^1, ^2i w, R) + 17n. 

If we set R = t then the total number of fault locations is A{n, ki, k2,w, R) = 0{wnt) using this method 
of error correction and assuming equal communication costs between qubits. In the worst case this can be 

0(n3). 

A.l Latin Rectangle Method for Optimizing Encoding Circuits 

There is a simple method for minimizing the number of memory locations in an ancilla encoding circuit 
due to Steane. Steane puts the generator matrix G of a linear binary code into standard form {I\A) using 
Gaussian elimination. An encoding circuit for the logical zero state can be constructing by looking at the A 
matrix for the code 6*2^. Every 1 in the A matrix gives a CNOT gate in the encoder. The control qubits are 
the Is in the / part of G and the target qubits are the Is in the A part of G. 

For example, we have G = (1010101,0110011,0001111) for the [7,3,4] code, which is the C^ for 
Steane's [[7,1,3]] code. Transposing columns 3 and 4 gives the standard form and an A = (1101, 1011, 0111). 
This means there are 9 CNOT gates in the logical zero encoder. We can assign each CNOT a time-step so 
that no qubit is involved in two gates at once. That constraint makes a time-step assignment the same as 
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finding a partial Latin rectangle. The Latin rectangle to complete is 



? ? ? 

? ? ? 

? ? ? 



and one possible 



1 2 3 

2 3 1 
3 1 2 



. The circuit con^esponding to the time-step assignment is: 



completion is 

# time 1 
cnot 1, 4 
cnot 2, 7 
cnot 3, 6 

# time 2 
cnot 1, 5 
cnot 2, 4 
cnot 3, 7 

# time 3 
cnot 1, 7 
cnot 2, 6 
cnot 3, 5 

We have to undo the qubit permutation that occuned in the Gauss elimination to standard form, so at the 
end we should switch back qubits 3 and 4. This is the smallest depth (3) that a circuit for A can have. The 
smallest depth is the maximum row or column sum w of A. 

The problem of completing the Latin rectangle and therefore of computing the optimal time-step assign- 
ment for a matrix A is equivalent to edge coloring a bipartite graph with the minimum number of colors. We 
construct the graph in the following way. The left set of vertices corresponds to the control qubits. The right 
set of vertices corresponds to the target qubits. A control and target vertex are connected by an edge if there 
is a CNOT between those two qubits. Assign a color to an edge to indicate what time-step we plan to do that 
CNOT gate. Since we cannot have two CNOT gates occur at the same time using the same qubit, all of the 
edges incident to a given vertex must have different colors. This means that a valid schedule corresponds 
to an edge coloring of this bipartite graph (bipartite because we have a set of control vertices that are only 
connected to target vertices, and a set of target vertices that are only connected to control vertices). By Hall's 
theorem ll32l . there is a coloring using w colors, and w colors is the minimum number of colors we can use. 
Here w is maximum weight of the rows of A minus 1. See BTl for an algorithm that finds an edge coloring 
with w colors in time O(niVcNOT)- Here n is the number of qubits that are to be encoded (i.e. number of 
vertices) and A'^cnot is the number of CNOT gates in the encoder (i.e. number of edges). 

We have tested several of the encoders produced by our Latin rectangle software tool with and without 
memory locations by simulating them using CHFo EH- 



B Syndrome Decoding 

General algorithms for constructing the classical circuits to decode measurement outcomes obtained in 
Steane error correction require exponential time and/or space. Therefore, we consider each code's syndrome 
decoder separately, essentially devising a special-purpose algorithm for each to make the decoding feasible. 
Table |2] lists all of the codes we consider in this study and their syndrome decoders. There are six 
distinct decoding algorithms that we use to compute the error locations and type of error from the syndrome 



"CHP stands for CNOT, Hadamard, Phase. 
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Code 



Decoder 



[[5,1,3]] 

[[7,1,3]] 

[[9, 1,3]], [[25, 1,5]], [[49, 1,7]], [[81, 1,9]] 

[[15,1,3]] 

[[13, 1,3]], [[41, 1,5]], [[85, 1,7]] 

[[21,3,5]] 

[[23,1,7]] 

[[47,1,11]] 

[[49,1,9]] 

[[60,4,10]] 



Table Lookup 

Table Lookup (cyclic) 

Majority 

Table Lookup 

Min. Wt. Matching 

Table Lookup (cyclic) 

Table Lookup (cyclic) 

Algebraic Ell 

Table Lookup with Message Passing 

Table Lookup (cyclic) 



Table 2: The decoders that we use for the codes in our study. 



measurements: a generic table lookup algorithm, a table lookup algorithm for cyclic codes over arbitrary 
fields, a majority voting algorithm for Bacon-Shor codes, a minimum weight matching algorithm for surface 
codes, a simple message passing algorithm for the concatenated Hamming code, and an algebraic decoder 
for the [[47, 1,11]] quadratic residue code. 

Rather than use a general table-lookup algorithm, we use a so-called Meggitt decoder which uses the 
fact that the polynomial codes and the Hamming, Golay, and quadratic residue (QR) codes are constructed 
from cyclic classical codes. Cyclic codes have a compact description in terms of a generating polynomial 
whose coefficients give one of the code words and whose cyclic shifts generate a basis for the code. The 
Meggitt decoding algorithm stores a table of syndromes and their associated error coiTcctions |[50l . For non- 
binary codes such as the polynomial codes, the table stores both eiTor locations and eiTor-type (the so-called 
amplitude). Only (J^i) syndromes need to be stored for a weight w error, since one of the coordinates 
can be fixed by the cyclic symmetry. Finding the appropriate recovery requires at most n table lookups. 
If we fail to find a recovery in the table, a subroutine is triggered that applies some syndrome-dependent 
correction mapping the state back into the code space. 

For cyclic codes with lai^ger distance where table lookup is impractical, for example [[47, 1, 11]], alge- 
braic decoding techniques can be used. The generator polynomial's roots are used to compute a sequence of 
syndromes from which we can locate errors. BCH codes are easy to decode because their generator poly- 
nomials have a contiguous sequence of roots so the Berlekamp-Massey algorithm can find the en^or-locator 
polynomial whose roots give the eiTor locations. Sometimes decoding up to the full minimum distance of 
the code is challenging because the generator polynomial may not have a long sequence of roots, so some 
syndromes are missing and the Berlekamp-Massey algorithm cannot be directly applied. In this case, un- 
known syndromes can sometimes be computed from algebraic equations involving the known syndromes. 
Algebraic decoding of the [[47, 1, 11]] proceeds this way. For each error weight from zero to t, we compute 
any missing syndromes, construct a polynomial whose roots are the eiTor locations, and find the roots of 
the polynomial. If the polynomial has enough roots, we correct those errors and stop. If we do not find 
enough roots for each of the locators, we return a "failed" result, triggering a subroutine that applies some 
syndrome-dependent correction that maps the state to some (possibly logically incon^ect) state in the code 
space. The implementation details can be found in P9l . 

The Bacon-Shor codes are essentially concatenated quantum repetition codes. Since the code stabilizer 
is preserved by bitwise Hadamard composed with a 90 degree rotation of the square lattice, one syndrome 
decoder is sufficient for both X and Z error correction. Imagine a vector of n^ syndrome bits placed on 
an 71 by n square lattice. Let Sx be the syndrome vector for X eiTors and s^ be the syndrome vector for 
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Z errors. Let R be the map on vectors of length r? that rotates them by 90 degrees on the square lattice. 
The same syndrome decoder is applied to s^ and Rs^^. The syndrome decoder decodes a variation on the 
classical repetition code on n bits. First, the decoder computes the parity of each column of the lattice and 
stores each column parity as an element of a vector p. Next, the decoder computes the repetition code parity 
check h = Hp. This paiity check H is expressed in standai^d form [I„_i 1] where 1 is the all ones column 
vector. Finally, the decoder infers the error locations from the parity check. If the weight of the parity check 
is greater than t, we must assume that the rightmost bit of p was incorrect so that /i ® 1 gives the error 
locations on the first n — 1 bits of p. Otherwise, we infer that the rightmost bit of p was correct so that h 
gives the en^or locations on the first n — 1 bits of p. 

The surface code is decoded using Edmond's minimum-weight matching algorithm. The approach differs 
slightly depending on whether Steane-EC or Shor-EC is used but is essentially the same as 1121 . Steane-EC 
gives a 2D matching problem whereas Shor-EC gives a 3D matching problem. The mapping from syndrome 
information to a matching problem is as follows. 

Nonzero syndrome bits are called defects and are located somewhere in the £ x i plane. We construct 
a complete weighted graph whose vertices represent defects and whose edge weights indicate the distance 
between defects. The surface code's syndrome may be such that there are lone defects which are not caused 
by error patterns connecting two defects, but by an error pattern connecting an edge-defect on the boundary 
to an inner defect. X and Z en^ors constitute separate matching problems and X-defects can be matched 
with, say, the horizontal boundaiies and Z-defects with the vertical boundaries. 

We can design an algorithm for decoding the surface code for, say, Z errors, as follows: 

• Imagine cutting the lattice vertically in two halves, left (L) and right (R). Let Ni/ff{i) be the number 
of defects in row i of the left/right part of the lattice. For each row of the lattice, add Ni/ji{i) edge 
defects on the ith row on the left (right) boundary. 

• Assign the weight of the edges between any edge defects as zero and assign the distance as the weight 
between edge defects and inner defects. 

• Compute the minimum-weight perfect matching of the graph of defects. 

• The recovery operation consists of applying phase flips on the qubits that are along the edges of each 
pair of matched vertices in the graph. 

Note that the algorithm enforces the property that the graph has an even number of vertices, so that every 
vertex can be matched. 

The concatenated [[7,1,3]] code, that is, the [[49,1,9]] code, can be decoded to distance 7 if we treat it as a 
concatenated code. However, decoding the code to distance 9 requires a slight modification of the algorithm 
so that a simple message is passed from level- 1 to level-2. 

Suppose the 49 transversal measurement outcomes are organized into 7 registers of 7 bits each. We use 
these registers as temporary storage to compute the appropriate correction. First, we compute the level- 1 
syndromes for each register as we would normally do. These syndromes indicate errors Cj in the ith level- 1 
register We coiTcct each level- 1 register according to the CjS and "flag" those registers for which Cj 7^ 0. 
Next, we compute the level-2 (logical) syndrome of the resulting 49 bit register, which now has trivial level- 
1 syndrome in each 7 bit register This level-2 syndrome indicates a logical correction e that is constant on 
each level- 1 register (but two level- 1 registers can take different values). The correction ci := (0^ Cj) © e 
corrects all eiTors of weight 4 or less, except for one problem case. This case occurs when a pair of errors 
occurs in one level- 1 register and another pair of en^ors occurs in a different level- 1 register. The problem is 
overcome by comparing the register positions where e is 1 with the positions of the flags whenever two flags 
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Figure 16: A fault-tolerant circuit for preparing logical Bell pairs for Knill eiTor correction of [[5, 1,3]]. 
The sub-circuit ED^ measures the stabiUzer of |o), {XZZXI, IXZZX, XIXZZ, ZXIXZ, ZZZZZ), 
using 4 and 5 qubit cat states, and the sub-circuit ED makes the same measurement without measuring 
Z = ZZZZZ. If any measurement outcome is nonzero, the Bell state is rejected. The sub-circuit M-^x 
measures XX = X®'^^ using a 10 qubit cat state. The Bell state is rejected if the XX measurements 
disagree, but if they are both 1 then Zi is applied to the output Bell pair. The cat states are verified so that 
if a cat state is accepted then a single fault in its preparation cannot produce a correlated error. 

are raised. If they disagree, apply the correction C2 := (0j Cj) © / where / is a logical correction on the 
flagged registers. Otherwise, apply the original correction ci. This procedure corrects all eiTors of weight 4 
or less and returns the input to the codespace in all cases. 

The classical decoding algorithms have been tested exhaustively for all of the codes in the paper except 
for the large Bacon-Shor and surface codes. The algorithms were found to correct all errors of weight t 
or less. In the case of the polynomial codes, t is the number of errors the underlying nonbinary code can 
correct. The same decoding algorithms that were tested exhaustively for small Bacon-Shor and surface 
codes were used for the larger codes in those families. 



C Data Tables 



[[n, 


k,d]] 


LI 


Ri 


CX/RecI 


PliPmcm = 0,po = 10 ") 


Pl(Pmcm =P0 = 10 ^) 


Pth(Pmem = 0) 


Pth(Pmcm = Po) 


5,1,3] 

[[5,1,3]] 
[[5,1,3]] 
[[5,1,3]] 
[[5,1,3]] 


2 

3 

5 

10 

10 


2 
3 
5 
3 
10 


2,160 

5,117 

14,775 

18,536 

60,760 


- 


- 


(3.9 ±0.7) X 10-5 
(9.2 ±0.5) X 10-5 
(9.2 ±0.5) X 10-5 
(8.8 ± 0.5) X 10-5 
(8 ± 3) X 10-5 


(2.5 ±0.4) X 10-5 
(3.7 ±0.3) X 10-5 
(3.3 ±0.6) X 10-5 
(4.3 ±0.3) X 10-5 
(3.0 ±0.6) X 10-5 


7,1,3] 
[[7,1,3]] 
[[7,1,3]] 
[[7,1,3]] 


2 
3 
4 
5 




519 

775 
1,031 

1,287 


(5.34 ± 0.07) x 10"^ 
(2.3 ±0.2) X 10-^ 
(1.9 ±0.1) X 10-^ 
(1.8 ±0.1) X 10-^ 


(7.05 ±0.08) X 10-^ 
(4.5 ±0.2) X 10-5 
(3.7 ±0.2) X 10-5 
(4.1 ±0.2) X 10-5 


(1.85 ±0.05) X 10-5 
(3.11 ±0.02) X 10-4 
(4.97 ± 0.07) X 10-4 
(5.3 ±0.1) X 10-4 


(1.46 ±0.05) X 10-5 
(1.98 ±0.01) X 10-4 
(2.56 ±0.06) X 10-4 
(2.58 ±0.06) X 10-4 


[[9,1,3]] 


1 




69 


- 


(4.90 ± 0.09) X 10-5 


(2.6 ±0.1) X 10-4 


(2.06 ±0.02) X 10-4 


[[13 
[[13 
[[13 
[[13 
[[13 


1,3]] 
1,3]] 
1,3]] 
1,3]] 
1,3]] 


3 

4 

5 

10 

15 




1,501 
1,997 
2,493 
4,973 
7,453 


(1.8 ±0.1) X 10-^ 
(1.9 ±0.1) X 10-5 


(4.3 ±0.2) X 10-5 
(3.9 ±0.2) X 10-5 
(3.9 ±0.2) X 10-5 


(1.59 ±0.04) X 10-4 
(3.81 ±0.07) X 10-4 
(4.9 ± 0.2) X 10-4 
(5.1 ± 0.2) X 10-4 
(4.9 ± 0.1) X 10-4 


(0.69 ± 0.03) X 10-4 
(1.95 ±0.04) X 10-4 
(2.30 ±0.08) X 10-4 
(2.54 ±0.07) X 10-4 
(2.63 ±0.07) X 10-4 


[[15 
[[15 
[[15 


1,3]] 
1,3]] 
1,3]] 


3 
4 

5 




2,127 
2,831 
3,535 


(1.3 ±0.2) X 10-^ 
(4.9 ±0.7) X 10-5 
(5.8 ±0.8) X 10-5 


(4.5 ±0.2) X 10-4 
(1.0 ±0.1) X 10-4 
(1.0 ±0.1) X 10-4 


(0.86 ± 0.03) X 10-4 
(1.5 ±0.6) X 10-4 
(1.8 ± 0.2) X 10-4 


(0.33 ±0.05) X 10-4 
(1.0 ±0.2) X 10-4 
(1.0 ± 0.2) X 10-4 


[[23 
[[23 
[[23 
[[23 
[[23 
[[23 
[[23 
[[23 


1,7]] 
1,7]] 
1,7]] 
1,7]] 
1,7]] 
1,7]] 
1,7]] 
1,7]] 


10 
20 
30 
40 
10 
20 
30 
40 


2 

2 
2 
2 


16,023 
32,023 
48,023 
64,023 
28,023 
56,023 
84,023 
112,023 


(1.1 ±0.3) X 10-'^ 
(1.2 ±0.4) X 10-^ 

(4±1) X 10-8 
(5±1) X 10-8 


(1.2 ±0.6) X 10-'^ 
(9 ± 4) X 10-8 

(3 ±2) X 10-8 
w<4x 10-8£ 


(1.14 ±0.05) X 10-3 
(2.33 ±0.02) X 10-3 
(2.98 ±0.04) X 10-3 
(3.33 ±0.02) X 10-3 
(5.76 ±0.09) X 10-4 
(1.23 ±0.01) X 10-3 
(1.628 ± 0.006) X 10-3 
(1.95 ±0.01) X 10-3 


(1.09 ±0.01) X 10-3 
(1.97 ±0.02) X 10-3 
(2.25 ±0.03) X 10-3 
(2.19 ±0.04) X 10-3 
(5.48 ± 0.09) X 10-4 
(1.15 ±0.01) X 10-3 
(1.487 ± 0.003) X 10-3 
(1.77 ±0.02) X 10-3 



"for [[5, 1, 3]] this pai'ameter is NB. 

'"for [[5, 1, 3]] this pai'ameter is NC. 

''for [[5, 1,3]] this parameter is the number of CNOT gates in a T3 rectangle 

''one failure in 5 x 10^ samples 



Table 3: Complete tabulation of code survey data, part 1 



[[n. 


k, 


d]] 


L 


Rl 


CX/Rec 


Pl(Pmcm = 0,po = 10 ^) 


Pl(Pmcm =P0 = 10 '') 


Pth(Pmcm = 0) 


Pth(Pmcm =Po) 


[[23 
[[23 
[[23 
[[23 


^ 


7]] 
7]] 
7]] 
7]] 


10 
20 
30 
40 


3 
3 
3 
3 


40,023 
80,023 
120,023 
160,023 


(4 ±2) X 10^*^ 


(3 ± 1) X 10^« 


(3.72 ±0.05) X 10-4 
(8.03 ±0.05) X 10-4 
(1.095 ±0.003) X 10-3 
(1.366 ±0.007) X 10-3 


(3.45 ±0.05) X 10-4 
(7.67 ±0.05) X 10-4 
(1.036 ±0.008) X 10-3 
(1.280 ±0.009) X 10-3 


25 

[[25 
[[25 
[[25 




5 

5]] 
5]] 
5]] 


4 
5 
6 

7 




1,465 
1,825 
2,185 
2,545 


- 


(1.2 ±0.7) X 10"^ 
(1.0 ±0.1) X 10"^ 
(1.08 ±0.08) X 10^6 
(1.1 ±0.2) X 10-*^ 


(8.6 ±0.2) X 10-4 
(1.13 ±0.02) X 10-3 
(1.16 ±0.02) X 10-3 
(1.17 ±0.04) X 10-3 


(7.44 ±0.05) X 10-4 
(9.74 ±0.07) X 10-4 
(1.034 ±0.008) X 10-3 
(1.01 ±0.04) X 10-3 


[[41 
[[41 
[[41 
[[41 
[[41 




5]] 
5]] 
5]] 
5]] 
5 


5 

10 

15 

20 

30 




11,321 
22,601 
33,881 
45,161 
67,721 


(7.3 ±0.8) X 10^** 
[3 X 10-7|* 


(2.39 ± 0.05) X 10-4 
[7 X 10-^] 


(1.86 ±0.02) X 10-4 
(7.44 ±0.03) X 10-4 
(1.224 ±0.003) X 10-3 
(1.577 ±0.004) X 10-3 
(2.06 ±0.01) X 10-3 


(7.9 ±0.1) X 10-^ 
(3.44 ±0.01) X 10-4 
(5.55 ±0.02) X 10-4 
(7.61 ±0.02) X 10-4 
(1.008 ±0.008) X 10-3 


[[47 
[[47 
[[47 




11]] 
11]] 
11]] 


10 
20 
30 




52,527 

105,007 

157,487 


(1.6 ±0.9) X 10"^ 


- 


(3.25 ±0.04) X 10-4 
(6.89 ±0.05) X 10-4 
(9.51 ±0.04) X 10-4 


(2.15 ±0.04) X 10-4 
(4.79 ± 0.03) X 10-4 
(6.45 ± 0.03) X 10-4 


[[49 
[[49 
[[49 
[[49 




9]] 
9]] 
9]] 
9]] 


5 

10 
15 
20 


2 
2 
2 
2 


61,549 
123,049 
184,549 
246,049 


(4±1) X 10^6 


- 


(1.02 ±0.02) X 10-4 
(3.63 ±0.08) X 10-4 
(4.0 ±0.02) X 10-4 
(4.2 ±0.3) X 10-4 


(5.4 ±0.1) X 10-^ 
(2.23 ±0.04) X 10-4 
(3.20 ± 0.08) X 10-4 


49 

[[49 
[[49 
[[49 
[[49 
[[49 
[[49 




7 

7]] 
7]] 
7]] 
7]] 
7]] 
7]] 


4 

6 

8 

9 

10 

11 

12 




2,961 
4,417 
5,873 
6,601 
7,329 
8,057 
8,785 


- 


(3.4 ±0.2) X 10"^ 
(2.8 ± 0.2) X 10"^ 
(3.3 ± 0.6) X 10"^ 
(2.2 ±0.7) X 10"^ 
(4.0 ± 0.9) X 10^^ 
(2.5 ±0.2) X 10-7 
(3 ± 2) X 10-7 


(4.73 ±0.09) X 10-4 
(1.18 ±0.01) X 10-3 
(1.41 ±0.02) X 10-3 
(1.48 ±0.02) X 10-3 
(1.42 ±0.03) X 10-3 
(1.46 ±0.03) X 10-3 
(1.46 ±0.02) X 10-3 


(3.20 ±0.02) X 10-4 
(8.7 ±0.2) X 10-4 
(1.169 ±0.005) X 10-3 
(1.224 ±0.005) X 10-3 
(1.235 ±0.005) X 10-3 
(1.241 ±0.006) X 10-3 
(1.242 ±0.006) X 10-3 


[[60 
[[60 


4 
4 


10]] 
10]] 


10 
20 




86,460 
172,860 


— 


— 


(1.129 ±0.004) X 10-4 
(3.91 ±0.02) X 10-4 


(2.20 ±0.04) X 10-4 



"for [[49, 1, 9]] this parameter is the number of preparation attempts for a 7-qubit encoded ancilla used in error detection 
''The values in square brackets are extrapolated from a linear least-squares fit to the logarithm of pi(po) 



Table 4: Complete tabulation of code survey data, part 2 



[n, k, d]] 



R 



CX/Rec 



PliPn 



0,Po = W- 



PljPmem = Po = W ^) 



Pi {Pmem = Po = iU 
(4.4 ± 0.7) X 10"^ 
[1 X 10"^]^ 

[7 X 10^^] 



Pth(Pii 



0) 



PthiPn 



Po) 



2.1 ±0.2) X 10-4 
7.1 ±0.1) X 10"4 
1.25 ±0.02) X 10" 
1.32 ±0.02) X 10' 

1.29 ±0.03) X 10" 

1.30 ±0.03) X 10" 
1.34 ±0.03) X 10" 
1.34 ±0.02) X 10" 



1.407 ±0.005) X 10-4 
4.47 ± 0.03) X 10-4 
9.57 ±0.03) X 10-4 
1.029 ± 0.004) X 10-3 
1.069 ± 0.006) X 10"3 
1.113 ±0.006) X 10"3 
1.098 ± 0.006) X 10-3 
1.112 ±0.006) X 10-3 



[[81 
[[81 
[[81 
[[81 
[[81 
[[81 
[[81 
[[81 



1,9] 
1,9] 
1,9] 
1,9] 
1,9] 
1,9] 
1,9] 
1,9] 



10 
11 
12 
18 
19 
20 



4,977 

7,425 

12,321 

13,545 

14,769 

22,113 

23,337 

24,561 



5.7 ±0.1) X 10-^ 
2.48 ± 0.01) X 10" 
4.18 ±0.05) X 10" 
5.59 ±0.04) X 10- 



2.03 ± 0.07) X 10-^ 
1.03 ±0.04) X 10-4 
1.76 ±0.02) X 10-4 
2.32 ±0.02) X 10-4 



[[85 
[[85 
[[85 
[[85 



1,7] 
1,7] 
1,7] 
1,7] 



5 

10 
15 
20 



30,405 
60,725 
91,045 
121,365 



[2 X 10" 



"The values in square brackets are extrapolated from a linear least-squares fit to the logarithm of pi(po) 
Table 5: Complete tabulation of code survey data, part 3 



[[n, k, d 



[[41,1,5]] 

[[85,1,7]] 

[[145,1,9]] 

[[221,1,11]] 

[[313,1,13]] 




Vi{P- 



Po = 10- 



(1.7 ±0.1) X 10-'' 
(5 ±2) X 10^^ 
(2±1) X 10"^ 
[8 X 10-^11 
[8 X 10-6] 



Pt\v{Pn 



Po) 



(6.8 ±0.6) X 10"^ 
(2.3 ±0.2) X 10"^ 
(4.5 ±0.2) X 10"^ 
(6.6 ±0.2) X 10-^ 
(9.0 ±0.4) X 10-^ 



"The values in square brackets are extrapolated from a linear least-squares fit to the logarithm of pi(po) 

Table 6: Surface code data using Shor-EC and a transversal CNOT as in |[T2l . taking i syndromes for an£ x i code EC. 



n, k,d]] 



family 



Pth (perfect ancilla) 



10-4 
10-4 
10-4 

X 10-3 
X 10-3 
10-3 

X 10-3 
X 10-3 
10-3 
X 10-3 
X 10-3 
10-3 
X 10-3 

X 10-2 

X 10-3 



[5,1,3]] 

[7,1,3]] 

[9,1,3]] 

[13,1,3]] 

[21,3,5]] 

[23,1,7]] 

[25,1,5]] 

[41,1,5]] 

[47,1,11] 

[49,1,7]] 

[49,1,9]] 

[60,4,10]] 

[81,1,9]] 

[85,1,7]] 

[121,1,11]] 

[145,1,9]] 

[169,1,13]] 



doubly-even dual-containing 

Bacon-Shor 

surface 

polynomial 

dual-containing 

Bacon-Shor 

surface 

doubly-even dual-containing 

Bacon-Shor 

doubly-even dual-containing 

polynomial 

Bacon-Shor 

surface 

Bacon-Shor 

surface 

Bacon-Shor 



(2.0 ±0.1) X 
(9.1 ±0.2) X 
(6.0 ±0.9) X 
(8.8 ±0.1) X 
< 10-5 
(5.34 ± 0.04) 
(1.88 ±0.04) 
(3.8 ±0.3) X 
(7.67 ±0.03) 
(2.56 ±0.05) 
(4.8 ±0.2) X 
(1.88 ±0.04) 
(2.88 ± 0.04) 
(7.5 ±0.3) X 
(2.83 ± 0.07) 
(1.01 ±0.02) 
(2.97 ±0.09) 



Table 7: Level- 1 pseudo-thresholds for rectangles using Steane-EC with perfect (noiseless) ancilla, n < 200. 
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